THE AUTOMORPHISM GROUP OF THE GROUP OF 
UNITRIANGULAR MATRICES OVER A FIELD 



AYAN MAHALANOBIS 

Abstract. This paper finds the generators of the automorphism group of the 
group of unitriangular matrices over a field. Most of this paper is an exposition 
of the work of V.M. Levchuk, part of which is in Russian. Some proofs are of 
my own. 



1. Introduction 

The automorphism group of the group of unitriangular matrices over a field was 
studied by many ||5]-[7l. In this direction, the first paper was in Russian, pubUshed 
by Pavlov in 1953. Pavlov studies the automorphism group of unitriangular ma- 
trices over a finite field of odd prime order. Weir |7| describes the automorphism 
group of the group of unitriangular matrices over an finite field of odd characteris- 
tic. Maginnis IH describes it for the field of two elements and finally Levchuk 
describes the automorphism group of the group of unitriangular matrices over an 
arbitrary ring. In this expository article, we will study the automorphism group of 
the group of unitriangular matrices over an arbitrary field F. 

To start, we denote the algebra of all lower niltraingular matrices over F, of size 
d, by NT((i, F). This is the set of all matrices that have zero on and above the 
diagonal, and arbitrary field element (possible non-zero) below the main diagonal. 
It is known to be a nilpotent algebra, M'^ = 0, for all M G NT{d, F). Where is 
the zero matrix of size d. The general method, that we describe below, works only 
when d is greater than 4. The case of d = 3 and d = 4 can be computed by hand 
and was done by Levchuk ||5l. Henceforth, we assume that d > 5. 

One can define two operations on the set NT((i, F). 

: The first operation is ., defined as a . b = a + b + ab. 
: The second operation is *, defined as a * b = ab — ba. 

It is known that (NT(d, F),.) is isomorphic to UT(d, F), the group of (lower) 
unitriangular matrices. The isomorphism being x i-)- 1 + x, where 1 is the iden- 
tity matrix of size d. This groups is also known as the associated group of the 
ring NT(d, F). In this paper, we will denote the associated group of NT(d, F) by 
m{d,F). 
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The second operation is a Lie bracket, it is known that (NT((i, F), +, *) is a Lie 
algebra. In this paper, we will denote this Lie algebra by NT*((i, F). It is not hard 
to see, in the light of Equations [3] and |4] later, that this Lie algebra is the same as 
the graded Lie Algebra of the group \JT{d, F). 

For i > j and x G F, we define the matrix unit xcij to be the d x d matrix with 
X in the position and everywhere else. 

The defining relations in these three algebraic objects are the relations in the 
field F and the following: 
The algebra NT(d,F). 

(1) {xei,j){yek,i) = 
The group UT(d, F). 

(2) (xeij) . (yeij) = {x + y)eij 

xyei^i whenever j = k 

(3) [xeij,yek,i] = ( -xyckj whenever i = l 



xysi^i whenever j 
otherwise 



otherwise 



The Lie algebra NT*(d, F). 



xyci^i whenever j = k 
(4) (xeij) * {yek,i) = { -xyckj whenever i = I 

otherwise 

From the Relation [3l it follows, that a set of generators for the UT(d, F), is of 
the form xcj+i^j, x e F and i = 1,2, . . . ,d — I. This is actually a set of minimal 
generators. Since the commutator relation and the relation in the Lie algebra are 
the same, the same set acts as generators for the Lie algebra as well. 

Define 

Tk = |m = ^ m,jeij G UT((i, F); rriij = 0, i - j < k^ , 

in other words, the Fi = \JT{d,F). The subgroup r2 is the commutator of 
UT((i, F). It consists of all lower niltriangular matrices with the first subdiago- 
nal entries zero. The first subdiagonal can be specified by all entries with 
i — j = 1. Similarly r2 consists of all matrices with the first two subdiagonals zero 
and so on. It follows that = 0. 

It follows from Relation^ ifi — j = ki and k — I = k2 and [xeij,yek,i] is non- 
zero, then the commutator is xyej / or xyckj- In both these cases, i — I or k — j 
equals ki + k2. Taking these into account, one can prove the next proposition. 



Proposition 1.1. In \JT{d, F), the lower central series and the upper central series 

are identical and is of the form \JT{d, F) = Fi > F2 > . . . > Td-i > F^ = 0. 
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There is an interesting and useful connection between the normal subgroups of 
UT(d, F) and ideals of NT*((i, F). The connection can be motivated by a simple 
observation: Let 1 + L is in UT{d, F), i.e., L € NT(d, F). Then 

(5) (1 + e^j)-^ (1 + L) (1 + e,j) = 1 + L + (L * e,^-) , 

which implies that under suitable conditions, elements in a normal subgroup of 
\JT{d, F) are closed under Lie bracket. Conversely, under suitable condition, an 
ideal of NT*((i, F) is a normal subgroup of \JT{d, F). 

Furthermore one should also notice, if a subgroup H of \JT{d, F) is abelian, 
then we have (1 + L)(l + M) = (1 + M)(l + L) for 1 + L, 1 + Af G H; which 
implies that L* M = 0, i.e., if a subgroup is abelian and an ideal, then that ideal is 
abelian as well and vice versa. 

Notice that, in the motivation above, we have represented an element of the 
group UT((i, F) as 1 + L, where L S NT((i, F). This is not necessary, we can use 
L and the operation .. However, since 1 + L makes the group operation matrix 
multiplication, this makes our motivation transparent. From now on, elements in 
\JT{d, F) will be represented as elements of NT(d, F), with the operation .. 

For i > j, let us define Nj ,,■ to be the subset of NT(d, F) with all rows less then 
the row zero and all columns greater than the column zero. It is a rectangle 
and Weir Q calls it a partition subgroup. It is straightforward to see that Nij is 
an abelian (two-sided) ideal of the ring NT((i, F). From this it follows that Nj j is 
an abehan ideal of NT* (d, F). 

Lemma 1.2. If if is a maximal abelian normal subgroup of UT(fi, F) or a maximal 
abehan ideal of NT* [d, F), then 

• H"^ CH. 

• a-f/3 + jSja = for a, /3 G and 7 G NT{d, F). 

Proof. From maximality, it follows that H contains the annihilator of the ring 
NT(d, F), i.e., the subset {x \ xy = = yx, for all y G NT{d, F)}. We show that 
is contained in the annihilator. We only work with the associated group, the 
proof for Lie algebra is identical. 

Since H is normal, for any a, f3 £ H and 7 = xcij G NT(d, F), a commutes 
with {—xcij) . /3 . {xcij). This implies that 

a . (/3 + /3{xeij) - ixeij)/3) = (/3 + /3{xeij) - ixeij)/3) . a 

Since H is abelian, a/3 = /3a, 

(a/3(xeij) + {xeij)(3a) - {a{xeij)(3 + (3{xeij)a) = 
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Now notice that the matrix represented in the first parenthesis has non-zero ele- 
ments only on the i row and the j column. On the other hand the matrix rep- 
resented by the second parenthesis has both the i row and the j column zero. 
Hence the equahty is possible only when both the matrices are zero. 

This implies that aj3{xeij) = = {xeij)/3a for i > j and i = 2,3, . . . ,d and 
j = 1, 2, . . . , d — 1. It is also clear that a^y/S + /37a = for 7 = xcj j. 

Notice that for any d x d matrix A, Aaj is the matrix with only the column 
non-zero and the contents are the contents of the i'*^ column. Thus al3{xeij) = 
implies that the column of a/3 is zero for z = 2, 3, . . . , d. Similarly one can 
show, that the row is zero for j = 1, 2, . . . , d — 1. Then it follows that C 

Since any matrix in NT((i, F) can be written as a linear combination of elemen- 
tary matrices xcj j, the proof that is contained in the annihilator is complete. 
Furthermore, since H is closed under addition, we have 07/? + /37a = 0, for all 
7GNT((i,F). • 

Theorem 1.3 (Levchuk, 1976). A maximal abelian normal subgroups of UT((i, F) 
is also a maximal abelian ideals of NT*((i, F) and vice versa. 

Proof. Let be a maximal abelian normal subgroup of UT((i, F). Then construct 
the subgroup 

AT' = (AT u {a - /?}, a,/3GiV) 
Clearly N' is an abelian subgroup. Since matrix multiplication distributes over 
addition, the subgroup N' is normal. Since N is contained in N' , the maximality 
implies that N = N' and so N contains the sum of any two of its elements. 

To show that it is closed under Lie bracket, notice that, N is normal implies 
{—xeij) . a . (xeij) is in N. This implies that a + {a * xcij) is in N. Since N is 
closed under addition, a * xcij is in N. 

The fact that N is an abelian ideal follows from the fact that is an abelian 
group. 

Conversely, assume that / is a maximal abelian ideal of the Lie ring NT*((i, F). 
Then from Lemma 11.21 C /. This proves that / is a subring, this implies 
that it is closed under .. So / is a subgroup, and since a * xcij G /, for a G / 
and xcij G NT((i, F), ) — {xeij)a G /. This shows that / is a normal 

subgroup. • 

2. Maximal abelian ideals of NT*{d, F) 

Notice that the centralizer of any set in the ring NT((f, F), the associated group 
UT(d, F), and the Lie ring NT*(c?, F) is identical. Let us look at the centralizer 

of Nij. Notice that, if I J2 am,nfim,n I A^i.j = A^ij ( E am,nem,n ) ■ Then the 

\iri>n J \m>n J 

left hand side is a linear combination of the rows of A^j j and the right hand side is a 
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linear combination of columns of Nij. Since the entries of Nij are arbitrary field 
elements, the only way that this is possible is that em.„Nj j = and Nj jC^.n = 
for m > n. So to find the centralizer is to look for m, n with m > n, such that 
em,nNij = = Nijem,n- Now it is easy to see that the centralizer 

Since C (Nj+i j) = Nj+i j, if Nj+i j is properly contained in an abelian ideal, then 
that ideal is contained in the centralizer; which is impossible. This proves that 

(6) Ni+i,i 

is a maximal abelian ideal of NT* (d, F) for i = 1, 2, . . . , d — 1. Further notice that, 

Tfc = Nk+i,i + Nk+2,2 + ... + Nd4-k, 

taking intersection of partition subgroups, it is easy to see that C (F^) = A'^^-fc+i^fc- 
In particular, if d is even, i.e., d = 2k for some integer k, then C (Ffc) = Nf^^i^k- 

2.1. Are there any other maximal abelian ideals of NT*(d, F)? Let H he a 
maximal abelian ideal of NT*(d, F). Following Levchuk H|, we define Hij to be 
a subset of F, whose elements are in the position of a matrix belonging to H. 

Let m be the smallest, and n be the largest integer such that Hm,i and 
iid,n 7^ 0. Since H is an ideal, for i > j > 1 and i < m, H * ej^i £ H. This 
implies that HijCi^i € H. Now for i < m, i^j i = 0, hence Hij = for i < m. 

Similarly, one can show that Hij = for j > n by looking at the fact, H*edj S 

H. 

Two things can happen, either n < m orn > m. In the first case, it is clear that 
H is contained in Nm,m-i and is thus Nm,m-i- 

If we assume that n > m, then the description of H is bit involved. It gives rise 
to maximal abehan ideals of exceptional type. 

First notice, if n = m, then Hm,j = for j > 1 and Hi^n = for i < This 
follows from the fact that C i (see Lemma [Oi l. 

In this case (n = m), let a, /3 G H. Then q * /3 = 0. If we write a = ^ '^ij^ij 

i>j 

and/3 = 06n,il3d,n — Pn,iOid,n = 0. Now noticc that being closcd 

i>j 

under addition, we can assume that and ad,n are non zero. This implies that 
the maximal abelian ideal H is of the form: for a fixed c S F 

(7) {Nj+i,j_i + xci^i + xced,i; x G F} . 

Now let us consider the case of n > m, in this case we first show that n > m + 1 
is impossible. 

For n > m,'vje. show that Hm,i = for i > 1. Notice that for n > m > i > 1, 
en,m * {H * Ci^i) € H. Since H * H = 0, this imphes that Hd^nHm,ied,i = 0. 
Since Hd,n + 0, Rrn,i = for i > 1. 
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In a similar way, looking at edj * {H * en,m) ^ H shows us that i/j „ = for 
i < d. 

Then forn > m + 1, (ff * e„,m+i) * em+i,m G This imphes that Hd^ned,n £ 
The fact that H * H = 0, gives us that Hd-nHm,ie-d,i = 0. However this is 
impossible. Hence n > m + 1 is impossible. 

Now we show that, if n = m + 1, then 2F = 0, i.e., F has characteristic 2. Since 
Hm,i and Hd^m+i are both non-zero. Since H is closed under addition, there is a 
matrix a ^ H, where a = ^"^^ '^m,i / and ad^m+i / 0. 

From Lemma [Ol we know that 2a{xem+i,m)ct = for any x € F, which says 
that 2x {ad,m+ictm,ied,j) = 0. Since x is arbitrary, we have 2F = 0. 

From Lemma [T?2l C i, this implies, Hm+i,i = for f > 1 and Hj^m = 
for j < d. Let a = ^ Since H is closed under addition, we may 

assume that i, am+i.ii "d.m. and ad,m+i are all nonzero. Then {H * e.m+i,m) * 
H = 0, implies that (a * em+i,m) * /3 = 0, where /? = Pi.j^iJ- This is the same 

as saying that ad,m+i/?m,i + ctm,il3d,m+i = 0. This implies that there is a c G F 
such that ad,m+i = cam,i- 

Then the maximal abelian ideal H is of the form: for a c € F and i = 2, 3, . . . , d— 

2, 

(8) Nj+2,i-i + aei+1,1 + bed,i + xci^i + cxed,i+i; a,b,x e F. 
So by now we have proved a theorem. 

Theorem 2.1 (Weir, 1955; Levchuk 1976). The maximal abelian ideals of the Lie 
ring NT*((i, F) are of the following form: (O, © and ([D above. The © occurs 
only when the field is of characteristic 2. 

3. The automorphism group of \JT{d, F) 

In this section we describe all the automorphisms of the group \JT{d, F). The 
automorphisms are as follows: 

Extremal automorphisms - Aut^;: These automorphisms arise from the max- 
imal abelian ideals of exceptional type. As we saw, the maximal abelian 
ideals of exceptional type are different for a field of characteristic 2. So, 
we will have two different types of automorphisms. One for even charac- 
teristic and other for the field of odd characteristic. 
Odd Characteristic: 

(9) xe2,i i-> xe2,i + axed,2 + x^ed,i 

Where : F — )• F is a map that satisfies the equation {x + y)^ — 
_ y\ — g^^y — 2^ — 2(l^). All othcr gcncrators remain 

fixed. 
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Similarly, one can define xea^d-i = xea^d-i + cLxcd-i^i + x-^e^^i. All 
other generators remain fixed and the A satisfies the above relations. 
If F is of even characteristic, then a = 0^. It is easy to see that, 
smce 0^ = 0, a = 0. So in the case of the characteristic of the field 
to be even, the extremal automorphisms become the central automor- 
phisms. 

Clearly automorphisms of this form generate a subgroup of the full 
automorphism group of UT(d, F), denoted by Aut^ and is isomorphic 
to F+ e F+. 
Even Characteristic: 

(10) xe2,i ^ xe2,i + axed,3 

all other generators remain fixed. Similarly one can define xed,d-i '-^ 
x^d,d-i + CLX^d-i.i- Again this automorphism group Aut^; is isomor- 
phic to F"*" © F"*". We will later show that these automorphisms are 
only possible when the field F is the field of two elements Z2. 
Flip automorphism - Aut^: This automorphism is given by flipping the ma- 
trix by the anti-diagonal and is given by the equation: 

(11) xeij xed-j+i,d-i+i 

This is clearly an automorphism of order 2 and forms a subgroup of the 
automorphism group and will be denoted by Autp. 

Diagonal automorphisms -Aut/j: This automorphism is conjugation by a 
diagonal matrix. Diagonal matrices are defined as matrices with only 
non-zero terms in the main diagonal and everything else zero. Let D = 
[x\,X2-, ■ ■ ■ I Xd\ be a diagonal matrix, with , X2 , . . . , as the non-zero 
diagonal entries in the respective rows. Then D~^xeijD = d~^xdjeij. 
So a diagonal matrix maps xcj j i-> d~^xdjeij. The kernel of this map is 
the set of all scalar matrices, i.e., xi = X2 = ■ ■ ■ = x^. This is clearly a 
subgroup of the automorphism group, which is of the form F^ x F^ x 
... X F^ (d — 1 times), and will be denoted as Autij. 

Field automorphisms - Aut^: This automorphisms can be described as 

(12) xei+i^i !->■ x^ei+i^i i = 1,2, . . . ,d - 1. 

Where : F — > F is a field automorphism. 
Inner automorphisms - Aut/: This is the well known normal subgroup of 

the automorphism group in any non-abelian group; where x g^^xg for 

some g G VT{d, F) and x £ \JT{d, F). 
Central automorphisms - Autc: Central automorphisms are the centrahz- 

ers of the group of iimer automorphisms in the group of automorphisms. 

The simplest way to explain them is to "multiply" the generators with an 
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element of the center. In the case of UT(d, F) it is 
(13) xei+i,j 1-^ xei+i^i + x^ed,i 

Where A is a linear map of F+ to itself. 

3.1. Why are these the only automorphisms of UT((i, F)? We know that any 
automorphism cf) of any group maps a maximal abelian normal subgroup to a max- 
imal abelian normal subgroup. Out first lemma uses that to prove: 

Lemma 3.1. Let be an automorphism of \JT{d, F). Then N^^^ ■ is either Nj+i j 
or Nrf„i+i_rf„i, for i = 1, 2, . . . , d - 1. 

Proof. Notice that the centralizer C(rfc) of T^, the k^^ element in the central series 
is characteristic. If d is even, and d = 2k for some k, then C (Pfc) = Nk^i^^. 
Hence iV^+i ^ = iVfc+i,fc- Further notice that for i < |, Nj+i^j n Nd-i+i^-i = 
Nf;_j_|_i j is a characteristic subgroup of \JT{d, F). Since flip is an automorphism 
of UT(d, F), any characteristic subgroup must be symmetric about the second di- 
agonal. 

Then N^_j_|_i j is the maximal characteristic subgroup of UT((i, F) contained in 
both Nj+i j and Nd-i+i^-i- This means that it must be contained as a max- 
imal characteristic subgroup in N^^^ ^. So N^^^ ■ has two choices, Nj+i j or 

^d-i+l,d-i- • 

It is important to notice here that xe2,i and xed,d-i are not only contained in 
the maximal abelian normal subgroups A''2,i and Nd^d-i ■ They are also contained 
in the exceptional subgroups. Let us call the maximal abelian normal subgroup of 
type ^ (or of type ([8]), when characteristic of the field is 2) containing xe2,i as A2 
and the maximal abelian normal subgroup containing xcd^d-i as Ad-i- 

It is clear from the last lemma, that A2 is either A2 or Ad-i- Then clearly, if 
necessary, composing with the flip automorphism, we claim that \JT{d, F) 
is invariant under cf). We can actually say more, xef_^^ ■ = x'^'Cj+i j mod r2, for 
i = 1,2, . . . ,d — 1. Where Aj : F — F is a map. Now let us try to understand the 
map A;. Since is an automorphism, each map Aj is a bijection. 

Now recall the relations in \JT{d, F) (Equations |2l). It follows from the relation 
xcij . ycij = (x + y)eij, if xcj+i^j x^'Ci+i^i then Aj is a linear map of F+. 

Furthermore, since [j;ej+i,j, yej,j_i] = [yei+i,i, xej,i_i], x^^y^^-^ = y^^x^^-^ 
fori = 2,3, ... - 1. Also, since [xe2,i, ^63,2]*^ = [ye2,i, 3:63,2], x^^y^^ = 

Taking all these together, it follows that Ai = A;iA2 = A;3A3 = • • • = kd-2^d-i- 
Where ki are nonzero fields elements. 

So now we are in a position to claim, that composing (p with a field automor- 
phism and a diagonal automorphism, <p maps like the identity UT(d, F)/r2. 

8 



As we saw from the above lemma, (j), (after composing with the flip, if neces- 
sary) maps A2 and to itself. So it follows that the xei^ i and xe^ are in A2 
and Ad-i respectively. In case of odd characteristic, the description of the extremal 
automorphism is obvious and is defined in Equation |9l 

In case of the even characteristic we need to say more. Notice that in the case 
that the characteristic of the field F being even, the maximal abelian normal sub- 
group containing 62,1 is 

N4,i + ae3,i + hed,2 + xe2,i + cxed,3 

where a,6GF,c€F^,x€F. We want to know more about the automorphism 
that moves 62,1- Using the flip automorphism, if necessary, we can assume that 
A2 = A2. So the only choice is 

xe2,i i-> 2:62,1 + CLxedfi + a;'^ed,2 + x^^d,\ 

where A, : F — )• F. From relation [2l we see that [x + y)^ = x'^ + y^. This 
implies that 0'*' = 0. Also it follows that [x + y)^ = x^ + y'^ + x^y. This implies 
that {x + y)f^ = + + 1-^xy. Putting x = and y = 1, we have that 0^ = 0. 
Putting x = 1 and y = 1 we get 1^ = 0. Since x^ = l^x, this implies that x^ = 0, 
i.e., A is the zero map. 

Once A is the zero map, clearly the e^^i entry is not necessary. Hence we have 
the Equation [TOl 

From the commutator relations (Relation O, we see that the 

xye3,i i-> xye3^i + ax^ye^,! + axyed,2- 

Then interchanging x and y, we see that axy{y — x) = 0. Since a 7^ 0, so in the 
field F, for any two distinct element, one is zero. This means that F = Z2. 

Notice that for i = s, [xei+i^i,yes,t] = xyei^i^f So if s — t = k, then {i + 
I) — I = k + I. Using this idea one can clear each and every subdiagonals, one 
after another, starting with k = 2. This means that by suitable conjugation, xef_^-^ ■ 

will have no non-zero entries except the {d, 1) entry. In the case of xe2i and 
^^dd-i '^^^ actually clear all non-zero entries, including the {d, 1) entry, using 
conjugation. One can choose the conjugators in such a way that this gives rise to 
an inner automorphism. This proves the following lemma: 

Lemma 3.2. Let Tk be as defined. For an automorphism (f) of UT(d, F), which 
fixes UT(d, F) mod r2, one can use inner automorphisms, such that (p acts Iflce 
the identity modulo Td-i- 

So now we have the following: 

• Use the flip automorphism, if necessary, so that ^f_^i j = Nj+i,j for i = 
2,3, 2. 
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• Using extremal automorphism, if necessary, so that N^^j^ ■ = Nj+i^j for 
i = l,2,...,d-l. 

• Use a field automorphism and a diagonal automorphism, if necessary, so 
that xef_^^^^ = xei+i^i mod 

• Use inner automorphisms, if necessary, so that xef^-^^ ■ = xei+i^i mod r^-i. 

• Use central automorphisms, if necessary, so that xef^-^ ■ = xej+i^j. Note 
that the central automorphisms corresponding to xe2,i and xe^^d-i are in- 
ner automorphisms. 

Now we have proved the following theorem. 

Theorem 3.3. The automorphism group of \JT{d, F) is generated by extremal 
automorphisms, field automorphisms, diagonal automorphisms, inner automor- 
phisms and central automorphisms. 
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